Connected Digits Recognition Task: ISTC–CNR Comparison of Open Source Tools
نویسندگان
چکیده
EVALITA is a recent initiative devoted to the evaluation of Natural Language and Speech Processing tools for Italian. In this work, the results of three open source ASR toolkits will be described. CSLU Speech Tools, CSLR SONIC, CMU SPHINX are applied on the EVALITA clean and noisy digits recognition task and this report will describe the complete evaluation methodology. CSLR SONIC has resulted to have the best performances in all the tasks and even with high specialized trainings. We think that it is mostly because of the PMVDR features used in this system. CMU SPHINX has been the easiest system to train and test and its general performances are only slightly lower than SONIC. CSLU Speech Tools is the most specialized recognition system on digit and its score stands in the middle of the others. Overall, the three systems have Word Accuracy score over 90%.
منابع مشابه
Evalita-istc Comparison of Open Source Tools on Clean and Noisy Digits Recognition Tasks
1. ABSTRACT EVALITA is a recent initiative devoted to the evaluation of Natural Language and Speech Processing tools for Italian. The general objective of EVALITA is to promote the development of language and speech technologies for the Italian language, providing a shared framework where different systems and approaches can be evaluated in a consistent manner. In this work the results of the e...
متن کاملAKIRA: a Framework for MABS
Here we present AKIRA, a framework for Agent-based cognitive and social simulations. AKIRA is an open-source project, currently developed mainly at ISTC-CNR, that exploits state-of-the-art techniques and tools. It gives to the programmer a number of facilities for building Agents at different level of complexity (e.g. reactive, deliberative, layered). Here we describe the main architectural fea...
متن کاملDesigning and Implementing MABS in AKIRA
Here we present AKIRA, a framework for Agent-based cognitive and social simulations. AKIRA is an open-source project, currently developed mainly at ISTC-CNR, that exploits state-of-the-art techniques and tools. It gives to the programmer a number of facilities for building Agents at different levels of complexity (e.g. reactive, deliberative, layered). Here we describe the main architectural fe...
متن کاملOff-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model
In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...
متن کاملA Facial Animation Framework with Emotive/expressive Capabilities
LUCIA is an MPEG-4 facial animation system developed at ISTC-CNR.. It works on standard Facial Animation Parameters and speaks with the Italian version of FESTIVAL TTS. To achieve an emotive/expressive talking head LUCIA was build from real human data physically extracted by ELITE optotracking movement analyzer. LUCIA can copy a real human by reproducing the movements of passive markers positio...
متن کامل